Model Selection

High-resolution image generation

# High-resolution image generation

FLUX.1 Dev GGUF

FLUX.1 [dev] is a rectified flow transformer with 12 billion parameters, capable of generating high-quality images based on text descriptions, providing powerful image generation capabilities for developers and creative professionals.

Text-to-Image English

Simpletuner Lora

simpletuner-lora is a text-to-image and image-to-image conversion model based on PEFT LoRA, derived from the FLUX.1-dev model.

Simpletuner Lora

LyCORIS adapter based on Stable Diffusion 3.5 Medium, specializing in photorealistic image generation

Image Generation

A LyCORIS adapter based on FLUX.1-dev, focusing on text-to-image and image-to-image tasks, supporting multiple resolution outputs.

Image Generation

Flux Lora Training

This is a standard PEFT LoRA derivative model based on FLUX.1-dev, focusing on text-to-image and image-to-image generation tasks.

Image Generation

Hidream5m Photo 1mp Prodigy

LyCORIS adapter based on HiDream-I1-Full, focusing on high-quality image generation

Image Generation

AuraFlow v0.3 is a fully open-source flow-based text-to-image generation model that supports multiple aspect ratios, with resolutions up to 1536 pixels.

terminusresearch

Standard PEFT LoRA model based on FLUX.1-dev, specializing in generating high-quality female character images

Image Generation

A PEFT LoRA model trained based on FLUX.1-dev, focused on text-to-image generation tasks, supporting specific artistic style conversion.

Image Generation

Dc Ae F32c32 Sana 1.1 Diffusers

DC-AE is a novel autoencoder architecture designed to accelerate high-resolution diffusion models. It maintains reconstruction quality at high spatial compression ratios through residual autoencoding and decoupled high-resolution adaptation techniques.

Image Generation

Dc Ae F32c32 Sana 1.1

DC-AE is a novel autoencoder architecture designed to accelerate high-resolution diffusion models, addressing reconstruction accuracy issues under high compression ratios

Image Generation

Sana 600M 1024px

Sana is an efficient text-to-image framework capable of generating images with resolutions up to 4096×4096, featuring rapid synthesis of high-resolution, high-quality images.

Text-to-Image Supports Multiple Languages

Efficient-Large-Model

Sana 1600M 1024px MultiLing

Sana is an efficient text-to-image framework capable of generating images with resolutions up to 4096×4096, supporting multilingual input.

Text-to-Image Supports Multiple Languages

Efficient-Large-Model

A text-to-image diffusion model fine-tuned based on FLUX.1-dev, specializing in GArt style image generation

Image Generation

Sana 1600M 1024px

Sana is an efficient text-to-image framework capable of generating images up to 4096×4096 resolution, deployable on laptop GPUs.

Image Generation Supports Multiple Languages

Efficient-Large-Model

Ebook Creative Cover Flux LoRA

A text-to-image model based on LoRA technology, specifically designed for generating e-book cover designs

Image Generation

Mlx Stable Diffusion 3.5 Large

MLX framework version optimized from Stable Diffusion 3.5 Large, specifically designed for Apple chip-optimized text-to-image generation models

Image Generation English

SD3.5 LoRA Futuristic Bzonze Colored

A LoRA fine-tuned model based on Stable Diffusion 3.5, specialized in generating images with a futuristic bronze color style.

Image Generation

Meissonic is a non-autoregressive masked image modeling text-to-image model capable of generating high-resolution images, specifically designed to run on consumer-grade GPUs.

Text-to-Image English

Cogview3 Plus 3B

CogView3-Plus-3B is the DiT version of CogView3, supporting text-to-image generation from 512 to 2048 pixels.

Text-to-Image English

Illustrious Xl V01 Sdxl

An early release version based on Stable Diffusion XL, focusing on generating anime-style illustrations through text-to-image modeling

Image Generation English

Mlx Stable Diffusion 3 Medium

MLX implementation of Stable Diffusion 3 Medium, focused on text-to-image generation

Image Generation English

Flux Controlnet Hed V3

Hed ControlNet checkpoint specifically designed for the FLUX.1-dev model, for image generation tasks

Image Generation English

Flux Controlnet Depth V3

FLUX.1-dev ControlNet is a deep ControlNet checkpoint developed by Black Forest Labs, suitable for image generation tasks at 1024x1024 resolution.

Image Generation English

Flux Controlnet Canny V3

Canny edge detection control network checkpoint for the FLUX.1-dev model, suitable for 1024x1024 resolution image generation

Image Generation English

AuraFlow v0.3 is a fully open-source flow-based text-to-image generation model that supports multiple aspect ratios, with resolutions up to 1536 pixels.

FLUX.1 Dev IP Adapter

IP adapter for the FLUX.1-dev model, supporting image processing similar to text for text-to-image generation tasks

Text-to-Image English

Flux Controlnet Collections

The FLUX.1-dev ControlNet Collection provides three pre-trained models (Canny edge detection, HED edge-aware, Midas depth map), optimized for 1024x1024 resolution image generation.

Image Generation English

Stable Diffusion XL (SDXL) 1.0 is a powerful text-to-image model capable of generating high-quality images from text descriptions.

Image Generation

Lumina Next SFT Diffusers

Lumina-Next-SFT is a 2-billion-parameter Next-DiT model that uses Gemma-2B as the text encoder and is enhanced through high-quality supervised fine-tuning (SFT) for text-to-image generation.

Pixart 900m 1024 Ft V0.6

A fully fine-tuned image generation model based on ptx0/pixart-900m-1024-ft-large, specializing in high-quality image generation

Image Generation

terminusresearch

Colorful XL is a text-to-image generation model based on stable diffusion technology, capable of producing high-quality and diverse images from text descriptions.

Text-to-Image English

Kohaku XL Epsilon Rev2

A text-to-image generation model based on Amber XL Epsilon rev1, optimized for selected artist works and specific series/game-related images

Image Generation English

Controlnet Canny Sdxl 1.0

A powerful control network model capable of generating high-resolution images with visual quality comparable to Midjourney, achieving precise control through Canny edge detection.

Image Generation

Terminus Xl Velocity V2

A full-rank fine-tuned text-to-image generation model based on terminus-xl-velocity-v1, supporting multiple resolution outputs

Image Generation

Pixart Sigma XL 2 1024 MS

PixArt-Σ is a latent diffusion model based on the Transformer architecture, capable of generating high-resolution images (up to 4K) directly from text prompts.

Image Generation

Envy Arcane Xl 01

A LoRA model fine-tuned based on Stable Diffusion XL 1.0, specializing in generating arcane magic-style fantasy concept art images

Image Generation

An image generation model fine-tuned based on the Stable Diffusion 1.5 framework, optimized for high-resolution output stability and supports multiple aspect ratio image generation

Text-to-Image English

Deliberate 2 is a text-to-image generation model that supports generating images in styles such as general, anime, and art.

Image Generation

Futaall V8 VAE Diffusers

A text-to-image generation model based on stable diffusion technology, capable of producing high-quality images from text descriptions.

Image Generation

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase